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We study the convergence of states under continuous-time depolarizing 
channels with full rank fixed points in terms of the relative entropy. The 
optimal exponent of an upper bound on the relative entropy in this case is 
given by the log-Sobolev-1 constant. Our main result is the computation 
of this constant. As an application we use the log-Sobolev-1 constant of 
the depolarizing channels to improve the concavity inequality of the von- 
Neumann entropy. This result is compared to similar bounds obtained 
recently by Kim et al. and we show a version of Pinsker’s inequality, which 
is optimal and tight if we fix the second argument of the relative entropy. 
Finally, we consider the log-Sobolev-1 constant of tensor-powers of the 
completely depolarizing channel and use a quantum version of Shearer’s 
inequality to prove a uniform lower bound. 


1. Introduction 

Let Md denote the set of complex d x d-matrices, Vd C Md the set of quantum states, 
i.e. positive matrices with trace equal to 1, and the set of strictly positive states. 
The relative entropy (also called quantum Kullback-Leibler divergence) of p, cr € 'Dd 
is defined as 


D{p\W) 


tr[p(logp - loga)], if supp(p) C supp(cr) 
-koo, otherwise 


( 1 ) 
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The relative entropy defines a natural distance measure to study the convergence of 
Markovian time-evolutions. For some state a S 1?^ consider the generalized depolar¬ 
izing Liouvillian Ca- Md ^ Md defined as 

{p) := tr [p] a- p. (2) 

This Liouvillian generates the generalized depolarizing channel : Aid -Md with 
T^{p) :=e*^‘’{p) = (l — e“*)tr [p] cr-|-e“*P: ■''^here t S R+denotes a time parameter. As 
(p) —>■ (T for < —>■ oo we can study the convergence speed of the depolarizing channel 
with a full rank fixed point a € Vd by determining the largest constant a S R"*" such 
that 


D{T^,{p)\\a) < e-2“*D(p||a) (3) 

holds for any p € Vd and any t € R’*'. This constant is known as the logarithmic 
Sobolev-1 constant mm of Ccr, denoted by ai {C^). In the following we will compute 
this constant and then use it to derive an improvement on the concavity of von- 
Neumann entropy. 


2. Preliminaries and notation 


Consider a primitiv^ Liouvillian C with full rank fixed point a G Dd and denote by 
Tt := the quantum dynamical semigroup generated by C. Consider the function 
fit) :=D{Ttip)\\ a) for some initial state p G Vd and note that if 


dt 


< -2af 


holds for some a G R+, then it follows that f(t) < e ^“‘/(O). The time derivative of 
the relative entropy at t = 0, also called the entropy production [3], is given by: 



-tr[/:(p)(log(cr) -log(p))] 


( 4 ) 


as tr(£(p)) = 0 for any p G Vd- This motivates the following definition: 

Definition 2.1 (log-Sobolev-1 constant, [I1I2])- For a primitive Liouvillian C : Aid —t 
Aid with full rank fixed point a G Vd we define its log-Sobolev-1 constant as 


aiiC) := sup G R : tr[/:(p)(log(cr) - log(p))] > 2aD ip\\a) ,Vp G (5) 


1 


A Liouvillian is primitive if, and only if, it has a unique full rank fixed point <j and for any p G T>(i 
we have e*'^(p) ^ cr as t ^ oo. 
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For a primitive Liouvillian C : Md ^ Md the preceding discussion shows that @ 
holds for any a < ai{C). Furthermore, oli{C) is the optimal constant for which this 
inequality holds independent of p G Vd (for states p not of full rank this follows from 
a simple continuity argument). 

In the following we will need some functions defined as continuous extensions of 
quotients of relative entropies. We denote by Qa- '■ 'D'^ —>• R the continuous extension 


of the function p i-G (see Appendix 


BI given by 


Qct{p) 


D{p\\(t) ’ 

1 , 


p^a 

p = a 


( 6 ) 


Note that for x G [0,1] and y G (0,1) the binary relative entropy is dehned as 


D 2 ix\\y) := xlog ( - ) + (1 - a;) log 


1 — X 
1 - 2 / 


(7) 


This is the classical relative entropy of the probability distributions {x^l — x) and 
{y, 1 — y). For y G (0,1) we denote by qy : (0,1) —)■ R the continuous extension of 

^ ^ sflSr gi^en by 


qy{x) 


X>2{,V\\X) „ 

D2{x\\y)^ _ 

1, x = y 


( 8 ) 


3. Log-Sobolev-1 constant for the depolarizing 
Liouvillian 

Note that for the depolarizing Liouvillian La with a G Pj" as defined in ([^ we have 
tr[/:<T(p)(log(cr) - log(p))] = D{p\\a) + D{a\\p). 

Inserting this into Definition |2.1| we can write 

ai{La)= ini^^(^l + Qa{p)y (9) 

Our main result is the following theorem: 

Theorem 3.1. Let La '■ M.d —/■ M.d be the depolarizing Liouvillian with full rank fixed 
point a G Vd as defined in ([^. Then we have 

ai (La) = niin (l + 9s„.„(a)(a;)) , 
a;G[0,l] ^ 

where Smini'x) denotes the minimal eigenvalue of a. 
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Figure 1: a\ {Ha) for Smm(o-) G [0,1], 


In Figure [T] th e values of a\ {La) depending on Si„in(o') G [0,1] are plotted. Note that 
by Theorem |3.11 we have ai {La) 1/2 in the limit Sniin(o’) —>■ 0 (as I? 2 (smin(o’)||x) —)■ 
0 and I? 2 (x||smin(c’')) —)■ oo in this case). 

Before we state the proof of Theorem |3.1| we need to make a technical comment. By 
(§ we have Qa{p) +oo as p —?► i.e. as p converges to a rank-dehcient state. 

Therefore, the infimum in Q will be attained in a full rank state p G 'D'^ and we can 
restrict the optimization to the compact set Ka C Vd (depending on a) defined as 

Ka = {p : Smin(p) > Smin {p) “ c} (10) 

for some fixed e G (0, Smin (p)) and where Sniin(-) denotes the minimal eigenvalue. Note 
that the minimizing state p is contained in the interior of Ka- Now we have to solve 
the following optimization problem for fixed a G 'D '^: 

inf Qa{p) = inf Qa{p)- (H) 


To prove Theorem 3.1 we will need the following lemma showing that the infimum 
in (11) is attained at states p G Vd commuting with the fixed point a. 

Lemma 3.1. For any a G we have 


inf Qa{p) 


inf 

peK„,[p,a]=0 


Qa {P) 


where Qa '■ —>■ H denotes the continuous extension of 


D{<^\\p) 

D{p\\a) 


(see (|^/. 
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Proof. Consider the spectral decomposition a — J2i=i Si\vi){vi\ for s G and fix a 
vector r G which is not a permutation of s and which fulfills minj(ri) > Smin (p) ~ £ 
(see ( [l0| ) and = 1- For some fixed orthonormal basis {|u>j)}j consider p := 

J2'j=i G ^cr- Inserting p into Qa gives: 


Qa{p) 


D{p\\a) 


-5'(cr) -tr[crlog(p)] ^ -S{a) - (s,Plog(r)} 
-S{p) - tr[plog(cr)] -S{p) - (log(s), Pr) 


( 12 ) 


where we introduced P G Md given by = | {vi\wj) p and log(s), log(r) G R*^ are 
defined as (log(s))i = log(si) and (log(r))j = log(rj). Note that P is a unistochastic 
matrix, i.e. a doubly stochastic matrix whose entries are squares of absolute values of 
the entries of a unitary matrix. We will show that the minimum of F over unistochastic 
matrices P is attained at a permutation matrix. By definition of P this shows that 
there exists a state p' G with spectrum r and commuting with cr, which fulfills 

Qa{p') < Qa{p)- 

As the set of unistochastic matrices is in general not convex [1] , we want to consider 
the set of doubly stochastic matrices instead. By Birkhoff’s theorem [3 Theorem 
II. 2. 3] we can write any doubly stochastic D G A4d as D = J2i=i lor some fc S N, 
numbers Xi G [0,1] with -^2 = 1 rmd permutation matrices Pi. Now we can write 

the denominator of F(D) as 


k 

-S{p) - (log(s),Pr) = ^ A* {-S{p) - (log(s),P,r)) = '^X,D{p,\\a) > 0 


i=l 


z=l 


where pi is the state obtained by permuting the eigenvectors of p with Pi. In the last 
step we used Klein’s inequality [6l p. 511] together with the fact that Pi ^ cr for any 
1 < i < A: as their spectra are different. The previous estimate shows that F is also 
well-defined on doubly stochastic matrices. 

Any unistochastic matrix is also doubly stochastic and we have 

inf |P(P) : P G Md doubly stochasticj < inf |P(P) : P G Md unistochasticj. 


Note that S'(cr) and S{p) in ( |l^ only depend on s G R^J. and r G Rif. and thus the 
numerator and the denominator of P are positive affine functions in P. This shows 
that P is a quasi-linear function 0 p. 91] on the set of doubly stochastic matrices. 
It can be shown (see 0 ) that the minimum of such a function over a compact and 
convex set is always attained in an extremal point of the set. By Birkhoff’s theorem [3 
Theorem II.2.3] the extremal points of the compact and convex set of doubly stochastic 
matrices are the permutation matrices. As these are also unistochastic matrices we 
have 


inf |p(P) : P G Md unistochastic| = inf |p(P) : P G Md permutation matrix|. 


This finishes the first part. 


5 





To prove the lemma note that we have 

inf Qaif) = Qa{p) 

peK„ 

for some minimizing full rank state p € Vj'. Now consider some sequence (pn)„g]N G 
with p„ —>■ p as n —>■ oo and such that the spectra of the pn are no permutations 
of the spectrum of a. By the first part of the proof we find a sequence 
commuting with cr, such that 

Qcrip) < Qa{Pn) < Qa{Pn) Qa{p) 

as n ^ oo.Thus Qa-{Pn) Qa-ip) as n —?► oo. On the compact set K^- the sequence 
{Pn)n ^ converging subsequence {Pn ^)with —>■ p' G as A: —>■ oo. By 

continuity of Qa we have Qa-{p') = Qa-ip) = infpgic^ Qa{p) and by continuity of the 
commutator p i—>■ [p, a] we have [p', cr] = 0. 

□ 


With this lemma we can prove our main result: 


Proof of Theorem \3.1\ By Lemma 3.1 we may restrict the optimization in 0 to 
states which com mute with a. Thus, we can repeat the construction of the compact 
set Kc (see 10) for a minimizer p G Pj" with [p, cr] = 0. By construction p lies 


in the interior of K^, which will be important for the following argument involving 
L agr ange- mult ipliers. 

To find necessary conditions on the minimizers of 0 we abbreviate C := infpg^_^ Qa{p) 
and note that C > 0. To see this, note that we may extend Qa{p) continuously to 
1 at CT, so there exists i5 > 0 s.t. for jjp — crjji < 5 we have Qa{p) > \ and for p 

s.t. Up — crjji > 5 we have Qa{p) > 2 iog(s'^. (a-B) Using Pinsker’s inequality and 
-D(p||cr) < log (smin (cr)). For any p G Ka- with [p, ct] = 0 and p ^ a have 


D{<^\\P) > ^ 
D{p\\a) - 


which is equivalent to 


d d 

S{a) < CS{p) + C'^ri\og{si) - ^Silog(rj). (13) 

Here {ri}f^i denote the eigenvalues of p G Kcr (see[^ fulfilling [p, cr] = 0 and 
the eigenvalues of cr. As p is a minimizer of ( |11| ) and commutes with cr its spectrum 
is a minimizer of the right-hand-side of 0 minimized over the set 5 := {r G : 
mini(ri) > Sinin(p) — e} C R'^ with e chosen in the construction of Ka- (see 0). We 
will now compute necessary conditions on the spectrum of p using the formalism of 
Lagrange-multipliers (note that by construction the spectrum of p lies in the interior 
of S). 
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Consider the Lagrange function F : 5 x R —>■ R given by 

d d 

F(ri,... ,rd, A) = CS{p) + ri log(si) - ^Silog(r^) + A 


^r,-l 


2=1 


2=1 


^2=1 


The gradient of F is given by: 


[VF(ri,...,rd,A)], = 


C{- log(rj) - 1 + log(sj)) - ^ + A 


l<j<d 
j = d+I 


( 14 ) 


By the formalism of Lagrange-multipliers any minimizer r = (ri,..., r^) of the right- 
hand-side of (13) in the interior of S has to fulfill VF(ri,..., r^, A) =0 for some A S R. 
Summing up the first d of these equations (where the jth equation is multiplied with 
rj) implies 

X = l + C{l + Dip\\a)). 

Inserting this back into the equations [VF(ri ,... A)]^ = 0 and using Uj = ^ we 

obtain 

Uj{l + CD{p\\a)) — 1 = Cuj log(rij) (15) 

for 1 < j < d. For fixed D[p\\a) there are only two values for Uj solving the equations 
(15), as an affine functions (the left-hand-side) can only intersect a strictly convex 
function (the right-hand-side) in at most two points. Thus, for a minimizer {ri}f^i of 
the right-hand-side of in the interior of S there are constants ci, C 2 G R'*’ such 
that for each i G {1,... ,d} either = ciSi or = C 2 Si holds. 

We have obtained the following conditions on the spectrum of the minimizer p G 
(fulfilling [p, cr] =0) of (11): There exist constants ci,C 2 € R”*" a permutation v G Sd 
(where Sd denotes the group of permutations on {1,..., d}) and some 0 < n < d such 
that the spectrum r G R^)" of p fulfills ri = cxSi for any 0 < f < n and = C 2 Si for 
any n + 1 < f < d. Note that the cases ci = C 2 = 1, n = 0 and n = d all correspond 
to the case p = a where we have Qa{cf) = 1. Thus, we can exclude the cases n = 0 
and n = d as long as we optimize over ci = C 2 = 1. Furthermore, note that we can 
use the normalization of p, i.e. ci + ''2 X)f=n-i-i Si = 1 to eliminate C 2 . Given a 

permutation v G Sd and n G {1, •. • ,d}, we define p{v,n) = Inserting the 

above conditions into 0 and setting ci = x and 0 < n < d yields 


inf Qcr(p) = inf inf inf qp(v,n) {.xp{v,n)) 


= inf inf inf qp(i, r 
i^eSdl<n<dxGlO,l] ’ 


)ix) 


(16) 

(17) 


where qy : [0,1] —>■ R denotes the continuous extension of x >->■ (see (|^). By 

Lemma A.l| in the appendix the function y i—> qy{x) is continuous and quasi-concave 
and hence the minimum over any convex and compact set is attained at the boundary. 
Thus, we have 


(a;) > inf inf gp(,,,„)(x) > inf 

l<n<d yG[s„i„(CT),l-s„in((T)] 


= 9s„,„(,T)(a;) 
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using = Qs„i„(a)(a:) for any x G [0,1]. Inserting this into ([^ leads to 


inf Q^{p) 

'G-Kcr 


inf 

a:G[0,l] 




□ 

Lemma |3.1| implies that the log-Sobolev-1 constant of the depolarizing channels 
coincides with the classical one of the random walk on the complete graph with d 
vertices and distribution given by the spectrum of cr. This constant has been shown to 
imply other inequalities, such as in [5J Proposition 3.13]. Using this result, Theorem 
|3.1| implies a refined transportation inequality on graphs. 

Using the correspondence with the classical log-Sobolev-1 constant of a random walk 
on the complete graph, we may apply [9l Example 3.10], which proves: 

Corollary 3.1. Let : Aid Aid be the depolarizing Liouvillian with full rank 
fixed point a G Vd as defined in ([^. Then we have 

^ 2 fo \/^min (r^) (1 -^min (rr)) 

with equality iff smin (c) = 5 . Again Smini.o') denotes the minimal eigenvalue of a. 

4. Application: Improved concavity of von-Neumann 
entropy 

It is a well-known fact that the von-Neumann entropy S{p) = —tr [plog(p)] is concave 
in p. Using Theorem |3.1| we can improve the concavity inequality: 

Theorem 4.1 (Improved concavity of the von-Neumann entropy). For p,a G T>d and 
q G [ 0 , 1 ] we have 


^((l - q)a + qp) - (1 - q)S{a) - qS{p) > 
\{l-q){l-{l-qY(p))D{a\\p) 


with 

, , . D2{Smin{cr)\\x) 

c{a) = mm ^ , n , 

2:G[0.1] D2[x\\Smin(CF)) 

and c{p) defined in the same way. 

Note that this bound becomes trivial if both a and p are not of full rank (as we 
have c{p) = c{a) = 0 in this case). However, as long as D{p\\a) or D{a\\p) < 00 , we 
may still get a bound by restricting both density matrices to the support of a or p, 
respectively. 
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Proof. Note that for the Liouvillian £ := — log{q)£„ we have: 

e^(p) = 9 P+(1-9)0-. 

By Theorem 3.1 and @ we have 

D{e^{p)\\a) < 


( 18 ) 


Rearranging and expanding the terms in (181 we get 


S{qp + (1 - 9 ) 0 -) > (1 - q)S{(T) - qtr [plog(cr)] + D{p\\a) 

= (1 - q)S{a) + qS{p) + q{l - q^^‘^'>)D{p\\a). 

Interchanging the roles of p and <j in the above proof gives the second case under the 
maximum. 

□ 

In [TU] another improvement on the concavity of the von-Neumann entropy is shown: 


S'((l - q)a + qp) - (1 - q)S{a) - qS{p) > max | 

(1- 2 ( 7)2 [£>(/9rev||/0avg) 

> ^9(l-9)llp-cr||i 


(19) 

( 20 ) 


where pavg = (1 — 9 ) 0 " + 9 P and prev = (1 — 9 )p + 90 . Note that this bound is valid for 
all states p,(J G Vd while our bound in Theorem |4)^ becomes trivial unless the support 
p is contained in the support of a or the other way around. We will therefore consider 
only full rank states in the following analysis. 

By simple numerical experiments our bound from Theorem |4.1| seems to be worse 


than (19). However, one can argue that (19) is not much simpler than the left-hand- 


side itself. In particular the dependence on p and cr is only implicit via the relative 
entropy between pavg and Prev. Our bound from Theorem |4. Ij depends on some spectral 
data (in terms of the smallest eigenvalues of p or ct), but whenever this is given, we 
have a bound for any q S [0,1] in terms of the relative entropies of p and a. 


Again we can do simple numerical experiments to compare the bounds (20) and 


Theorem 4.1 Recall that our bound is given in terms of the relative entropy and 


(20) in terms of the trace norm. In Figure]^ the bounds are compared for randomly 
generated quantum states in dimension d = 10. These plots show that the bounds are 
not comparable and depending of the choice of the states the bound from Theorem 
4.1 will perform better than (20) or vice versa. Note that for q close to 0 or 1 our 


bound seems to perform better in both Figures. This is to be expected as a\ (/Co-) 
is defined as the optimal constant a bounding the entropy production (|^ (in t = 0) 
by —2aD{p\\a). Therefore, Theorem 4.1 should be the optimal bound (in terms of 
relative entropy) for q near 0 or 1. 

Note that by applying Pinsker’s inequality: 


D{p\\cr) > ^\\p-(j\\l 


( 21 ) 
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(a) 


(b) 


Figure 2: Comparison of bound (20) (red) and the bound from Theorem 4.1 (blue 


where both choices of the ordering of p and a are plotted) and the exact 
value S'((l — q)a + qp) — (1 — q)S{a) — qS{p) (black) two pairs of randomly 
generated 10 x 10 quantum states and q G [0,1]. 


for states p,cr £ "Dd to our bound from Theorem 4.1 we can obtain an improvement on 
the concavity inequality in terms of the trace-distance similar to (201. Unfortunately 
a simple computation shows that the resulting trace-norm bound is always worse than 
(20). In the next section we will show that Pinsker’s inequality can be improved in 
the case where the second argument in the relative entropy is fixed (which is the case 
in the bound from Theorem 4.1). This will lead to an additional improvement of the 
trace-norm bound obtained from Theorem 4.1 such that in some (but only very few) 
cases the bound becomes better than (20). 


5. State-Dependent Optimal Pinsker’s Inequality 

Pinsker’s inequality © can be applied to the bound in Theorem |4.1| to get an im¬ 
provement of the concavity in terms of the trace distance of the two density matrices. 
It can also be applied to (§ to get a mixing time bound [5] for the depolarizing chan¬ 
nel. Note that in both of these cases the second argument of the relative entropy is 
fixed. Other improvements have been considered in the literature m, but here we will 
improve Pinsker’s inequality in terms of the second argument of the relative entropy. 
More specifically we compute the optimal constant C (a) (depending on a) such that 
D (p||ct) > C (a) \\p— a\\i holds when a has full rank. 

We will follow a strategy similar to the one pursued in m in proving this, where 
the analogous problem was considered for classical probability distributions. For a 
state p GVd let s{p) = {si{p ),..., Sd{p)) denote its vector of eigenvalues decreasingly 
ordered. 
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Lemma 5.1. Let a G V'^ and for AC {1,..., d} define (A) 
have for e > 0; 


Si{a). Then we 

i^A 


min D (p||cr) = min {Pa (A) + e\\Pa (A)) 

p:||p-o’lli>e AC{l,...,d} 

Proof. Let p € be such that \\p — cr||i = 5, with 5 > e. By Lidskii’s theorem O 
Corollary IIL4.2], we have: 

s{p-a) = s{a)- Ls (p), 

where L is a doubly stochastic matrix. Define p' to be the state which has eigenvalues 
Ls {p) and commutes with a. Then we have: 

IIP-ct||i = ||p'-cr||i 

By the operational interpretation for the 1—norm [SI Theorem 9.1] there exist hermi- 
tian projections Q, Q' G Al„ such that 

2tr[Q (p - cr)] = ||p-cr||i = ||p'- cr||i = 2tr[Q' (p - cr)] . (22) 


Now define the quantum channel T : Aid —t AI 2 given by: 


r(p) = tr[Qp]|0)(0| +tr[(ll - Q)p]|l)(l|. 

where |0), |1) is an orthonormal basis of (D^. By the data processing inequality we 
have: 

D{p\\a)>D{T{p)\\T{a)) (23) 

It is easy to see that the image of Q' must be spanned by eigenvectors of a. Thus, we 
may associate a subset AC {1,..., d} to the projector Q' indicating the eigenvectors 
of a spanning this subspace. Using (22) and the assumption that jjp — ct||i = S we 
have: 


tr[Q'p'] = Pa {A) + 


Also observe that 


D{T{p) \\T{a)) = D 2 (^Pa (A) + ^-\\Pa (A)) > (p. (A) + ^-\\Pa (A)) 


as the binary relative entropy is convex and 6 > e was assumed. With (23) we have: 

min D (p||cr) > min D2 (Pa (A) + 

P:||p-'^lll>e AC{l,...,d} V 


Pa 


(24) 


Now given any AC {1,..., d} such that Pa (A)+| < 1 (otherwise D 2 {Pa (A) + e||Pcr (A)) 
+ 00 ) , dehne a state r G T>d which commutes with a and has spectrum: 


Si(T) 


{Pcr{A)+e/2)si(a) 

PAA) 

(l-P^(A)-e/2)sC<y) 

l-Pa(A) 


for I G A 
else. 
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Note that \\a — t||i = e and D (t||ct) = £>2 {Pa (^) + f ll^o- (^))) i-e. the lower bound 
in (24) is attained. 

□ 


We define the function 0 : [0, R as 




1 -P 
P 


(25) 


extended continuously by ^ =2. Furthermore for any tr G Vd we define 


TT (a) = max min 




(26) 


ieA 


With essentially the same proof as given in m for the classical case we obtain the 
following improvement on Pinsker’s inequality: 


Theorem 5.1 (State-dependent Pinsker’s Inequality). For a, p G Vd we have: 


(27) 


with (j) as in (25) and tt (a) as in (261. Moreover, this inequality is tight. 
Proof. For convenience set \\p — a\\i = 5. Then we have 

£>(p||cr)> min D{p'\\a)= min D 2 (Pa {A) + \\\Pa {A) 


p':||p'-o-||i>(5 


AC{i....,c;} 


(28) 


using Theorem 5.1 By m Proposition 2.2] for p G [0, and e > 0 we have 

P >2 (p + e|b) < £>2 (1 - P + ejj 1 - p) 

so we may assume Pa (^) < ^ in (28). In [131 Theorem 1] it is shown that for p G [0, |] 
we have 


. £>2(p + e||p) ^ 

inf - 2 . = <P \P) 

eG(0.1-p] e^ 


(29) 


which implies: 


AC{l,...,d} 


min £>2 Pa {A) - Pa{A)] > min 


(t>{PAA)), 


AC{l,...,d} 


!p-CT||i. 


By m Proposition 2.4] the function (j) is strictly decreasing. Thus, we have 

(t){Pa{A)) (fill {a)) 


mm 
AC{l,...,d} 
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which, after combining the previous inequalities, finishes the proof of (27). To show 


that the inequality is tight, we may again follow the proof of m Proposition 2.1]. 
Let B C {1,..., d} be the subset such that tt (a) = (B) =: p. Define a minimizing 

sequence {cijigiN with > 0 for the infimum (with respect to p) in (29), i.e. such that 

D 2 {p + ei\\p) 


lim 
2—>-00 


= <!>{?)■ 


Next define a sequence of states pi that commute with a and have spectrum: 


•j {P^) = 


{p+ei)si{a) 

{l-p-ei)si(a) 

1-p 


for j £ B 
else. 


One can check that jjpi — cr||i = 2ei and D {pi\\<j) = D 2 (p + ei||p), from which we get: 

D{p,\\a) «!>( 7 r (cr)) 


lim 


\\Pi - ' 


□ 


In some cases the bound can be made more explicit, as illustrated in the next 
corollary: 

Corollary 5.1. Let a,p GVd be such that ||cr||oo > 5 - Then: 


Proof. In this case it is clear that tt (ct) = 1 — ||cr||oo. 


(30) 

□ 


Note that we have (f (x) —> +00 for x —)■ 0. Thus, there might be an arbitrary large 
improvement of ( |27| ) compared to the usual Pinsker’s inequality (21). This happens 
for instance in Corollary |5.1| when ||cr||oo 1, be. when a converges to a pure state. 


By applying the improved inequality (27) to Theorem 4.1 we obtain for quantum 
states p,a ^Vd and q £ [ 0 , 1 ] 


9(i-g^("))^^llp-a| 


5((1 - q)a + qp) - (1 - q)S{a) - qS{p) > max ^ 




IIp-ct| 


with (/) as in (25) and tt (a) as in (26). 

Even using this refinement of Pinsker’s inequality, some numerical experiments in¬ 
dicate that (20) is stronger for randomly generated states. From Corollary 5.1 we can 


expect our bound to perform well if a has a large eigenvalue and the smallest eigen¬ 
value is as large as possible. Such states have spectrum of the form ^p, i i gEf) ■ 
Indeed for a £ 1)5 with spectrum (0.99,0.0025,0.0025,0.0025,0.0025) and q < 0.2 our 


bound performs better than (19) for randomly generated p. However, even in this case 


the improvement is not significant. 

Still we can expect that Theorem |5.1| will find more applications, for instance im¬ 
proving the mixing time bounds. Such bounds have been derived from log-Sobolev 
inequalities in [5]. The next theorem can be used to improve these results: 
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Theorem 5.2. Let C : —>■ he a primitive Liouvillian with fixed point a that 

satisfies 


D{e*^p\\a)<e-^‘^*D{p\\a) 

for some a > 0 and for all p € Vd and t S R'*'. Then we have 


e*^(p) - cr|li < 2 e~ 


/log(5min(o-)) 
</)(7r (cr)) 


(31) 


(32) 


with (f> as in (25), tt (cr) as in (26) and where Sinin(CT) is the smallest eigenvalue of a. 
Proof. This is a direct consequence of (|^ and (27). □ 


6. Tensor products of depolarizing channels 

For a Liouvillian C : M.d ^ M.d generating the channel 7t = and any n S N we 
denote by : Aid" —t Aid" the generator of the tensor-product semigroup (Tt)®", 
i.e. 

Here we study ai (£^) in the special case where cr = ^. For simplicity we denote 
the depolarizing Liouvillian onto cr = ^ by £d := and by Tf = ef'^'^ the generated 

semigroup. In the case d = 2 it is known m that ai [£^ 2 ^^ ~ 1 ’T' G IN . It 

is, however, an open problem to determine this constant for any d > 2 and any n >2. 
We will now show the inequality Oi — 5 d > 2 and n > 1, which is 

the best possible lower bound that is independent of the local dimension. Note that 
for cr = ^ inequality ([^ for the channel can be rewritten as the entropy 

production inequality: 

SiiTff^p)) > (1 - e-‘)nlog(d) + e-*S{p). 

This inequality has been studied in [T3] for the case where d = 2, for wich, however, 
an incorrect proof was given. We will provide a proof of a more general statement, 
from which the claim ai ^ readily follows by the previous discussion. 

Theorem 6.1. For any cr^p £ Vd (not necessarily full rank) we have 

^((Tf )®"(p)) > e-^S{p) + (I - e-‘)S'(cr®”). 


For the proof we will need a special case of the quantum Shearer’s inequality. We will 
denote by p S I? ® ® ® (D'^") a multipartite density matrix (where the di 

are the local dimensions of each tensor factor). Furthermore we write ^(ii, ^ 2 , ■ ■ ■, ik)p 
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for the entropy of the reduced density matrix p on the tensor factors specified by the 
indices ii,i 2 , ■ ■ ■ ,ik- Similarly we write 


,..., ifc|ji,..., j;)p — S{ii,..., ik, ji, ■ ■ ■, ji)p ^{jij ■ ■ ■ j ji)p 

for a conditional entropy. The proof of the quantum version of Shearer’s inequality 
is essentially the same as the proof given by Radhakrishnan and Llewellyn for the 
classical version (see m)- For convenience we provide the full proof: 

Lemma 6.1 (Quantum Shearer’s inequality). Consider t S N and a family J- C 
2 {i.....n} subsets of . , n} such that each i G {1,..., n} is included in exactly t 
elements of T. Then for any p GV ((D'^^ 0 ® ® (D'’*") we have 

5(l,2...,n)p< J ^ 5(F)p . (33) 

Proof. For F C n} denote its elements by {ii,... ,ik), increasingly ordered. 

For any p G V (C'^i (g) (g) • • • (g) C'’*") we have 

Rl 

^ ^ l)p “ ‘^(^l)p “b *S’(z2|il)p “t“ * * * “t“ *5'(2 |^||zi, 22, . . . , 2|p’|_i)p 

t = l 

= S{ii,i2,...,i\F\)p ~ ^{^)p 

where we used a telescopic sum trick. By strong subadditivity m conditioning de¬ 
creases the entropy. This implies 


Ii"! 

^ 5 ( 2 , 11 , 2 ,.. 


f=i 


\F\ 

l)p < ^ Kii ■ ■ • I ij—i)p = 

t=i 


S{F)p. 


(34) 


Now consider a family F C with the properties stated in the assumptions. 

Using (34) for the first inequality gives: 


Ul 

F{F)p > S{ij 11,2,..., 2j — l)p 

FeF FeFj=i 

n 

= t'^S{i\l,2,... ,i - l)p =tS{l,2,...,n)p. 
2 = 1 


(35) 

(36) 


Here we used the assumption that each i G {1,..., n} is contained in exactly t elements 
of F and (341 in the special case of F = {1,..., n} for the final equality. □ 


Note that in the classical case Shearer’s inequality is true under the weaker assump¬ 
tion that any i G d} is contained in at least t elements of F. However, as 

the quantum conditional entropy might be negative m we have to use the stronger 
assumption to get the equality between (35) and (361 where an > would be enough. 
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In the special case where F = Fk '■= {F C : |J^| = k} denotes the family 

of fc-element subsets of n} (i.e. every i G d} is contained in exactly 

elements of Fk) the quantum Shearer inequality gives 

-5(1,...,n)< ^ ^ 5(F). (37) 

This inequality was also proved in |18| . but in a more complicated way and without 
mentioning the more general quantum Shearer’s inequality. It is also used as a lemma 
(with wrong proof) in [2], where the rest of the proof of their entropy production 
estimate is correct. The proof of Theorem |6]^ follows the same lines. For completeness 
we will include the full proof here: 

Proof of Theorem \6.1\ In the following we will abbreviate p := e“*. For a subset 
F C {1,... ,n} we denote by p\f the reduced density matrix on the tensor factors 
specified by F. Using this notation we can write 

= E E ((g)a®p|F= 

k—OF^J^k \igF 


where F'^ = {1,..., n} \ F. Concavity of the von-Neumann entropy implies 

n 

5 {{Tn^-{p)) > E E (1 -P)V-" ikS{a) + 5(F^)p) 

k—Q F^Fk 

> il-p)nSia) + ^ f ) !i^(i _p)V-'=5(p) 

= {l-p)S{a^n+pS{p). 


Here we used the elementary identity Fik=o 
the {n — fc)-element subsets F'”. 


(^) (1 — p) V” = (1 — p)n and ([37| for 

□ 
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A. Quasi-concavity of a quotient of relative entropies 

In this appendix we will prove the quasi-concavity of the function y i—^ Qyix) for any 
X G (0,1). As defined in Q the function Qy : (0,1) —R denotes the continuous 
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extension of x i-)- . In the following we consider : [0,1] —)■ R defined as 

fx{y) = Qy{x) for any y G (0,1) and with /a;(0) = fx{^) = 1- It can be checked easily 
that fx is continuous for any x G (0,1). We have the following Lemma: 

Lemma A.l. For any x G (0,1) the function fx : [0,1] —R given by fx{y) = 

for y ^ {0,x, 1} and extended continuously by fx{x) = 1 and fxiO) = fx{^) = 0 is 

quasi-concave. 

Proof. Note that without loss of generality we can assume x > I, as fx{y) = fi-x{^ — 
y). By continuity it is clear that there exists an m/ G (0,1) (we can exclude the 
boundary points since fx{x) > fx{ 0 ) = /^(l)) such that fximf) is the global maximum. 
By ll] [p- 99] it is sufficient to show that fx is unimodal, i.e. that fx is monotonically 
increasing on [0, m^) and monotonically decreasing on {nif, 1]. We will use the method 
of L’Hospital type rules for monotonicity developed in [min]. 

For any x G (0,1) and y G (0,1) with x ^ y we compute 


dyD 2 {y\\x) = log 


dy\og 


f y(^-x) 

\x(l-y) 


y(^-x) \ 

x(l-y)J 

1 

y(i - y) 


dyD 2 {x\\y) 


y-x 

2/(1 - y) 


_ y-x 

2/(1 - y) 

+ X — 2yx 

(1 _ yYy2 


and define 


9x{y) 

hx{y) 


dyD 2 {y\\x) 
dyD 2 ix\\y) 



xp-v) 

y{l-x) 


5, 


y y{i-y) 


log(f{^) 2/(1-2/) 

x-y 

^ ^ 2/(1-y) 

y^ X — 2yx 


(38) 

(39) 


where again gx is extended continuously by ^3,(0) = gx{^) = 0 and gxix) = 1. As 
y y^-\-x — 2yx has no real zeros for x G (0,1) the rational function hx is continuously 
differentiable on (0,1). A straightforward calculation reveals that for x > ^ and on 
(0,1) the derivative h'x only vanishes in 


mt 


x(l — x) 
2x-l 


for X > t 
for X = I 


which has to be a maximum as hx{0) = hxil) = 0. By the lack of further points with 
vanishing derivative we have h'^(y) < 0 for any y < mh and also h'^{y) > 0 for any 
y > ruh. Note that ruh < x for any x> 

Consider first the interval (x, 1) C (0,1). For ?/ —>• x we have log f 0 and 


y^iJ^y) —t 0. Also it is clear that y i-O- y^i^y^ does not change sign on the interval (x, 1). 
Therefore and by (39) we see that the pair gx and hx satisfy the assumptions of na 
Proposition 1.1.] and as hx is decreasing we have that g'x{y) < 0 for any y G (x,l). 
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We can use the same argument for the (possibly empty) interval {mfi,x) where is 
decreasing as well and obtain < 0 for any y G (rnh,x). By continuity of in x 

we see that gx is decreasing on (m^, 1). 

Note that in the case where x = ^ we can directly apply na Proposition 1.1.] to 
the remaining interval (0, where hi/2 is increasing. This proves 3^/2 (l^) ^ ^ 
any y G (0, 5). By continuity nig = ^ is the maximum point of gi/2- For x ^ 
where the remaining interval is {0,mh) we apply the more general Proposition 2.1. 
in [5D]. It can be checked easily that the assumptions of this proposition are fulhlled 
for the pair gx and hx- As for y G (0,m?i) we have < 0 and as hx is 

increasing the proposition shows that > 0 for any y G (0,mg) and g'xiy) < 0 for 

any y G {mg,mh)- Here nig G {0,mh) denotes the maximum point of gx (note that a 
maximum nig has to exist due to continuity and gxiO) = gxi^) = 0). 

The previous argument shows that for any x > | there exists a point nig G (0, mh] C 
(0,x] (we have nig = = i for x = ^) such that g'xiy) > 0 for y G {0,nig) and 

9xiy) < 0 for y G (nig, 1) \ {x}. We can now repeat the above argument for the pair 
fx and gx- This gives the existence of a point nif G (0,mg] such that fx{y) > 0 for 
any y G (0, m/) and f'xiy) < 0 for any y G (m/, 1) \ {x}. By continuity in x this shows 
that the function fx is unimodal and therefore quasi-concave. 

□ 


B. Continuous extension of a quotient of relative 
entropies 

In this section we show that the function R as defined in § is indeed 

continuous. As is clearly continuous in any point p a we have to prove the 
following: 

Lemma B.l. For a G 'D'^ and X G Aid with X = X^ , tr[A] = 0 and X 0 we have 

lim + = 1. 

e-s-o D (a + eA| |cr) 

Proof. To show the claim we will expand the relative entropy in terms of e up to second 
order. Observe that for p G T>d we have 


D{p\\a) 




(40) 


In the following we assume e > 0 to be small enough such that a + eX G Vj'. To 
simplify the notation, we introduce A(t) := (a + t)~^ and B(t) := (a + eX + t)~^. 
Applying the recursive relation 


B(t) = -eB(t)XA(t) + A(t), 
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twice leads to 


B{t) - A{t) = -eB{t)XA{t) = e^B{t)XA{t)XA{t) - eA{t)XA{t) 
= t^A(t)XA{t)XA{t) - eA{t)XA{t) + O (e^) . 


Inserting this into (401 gives 


D {a\\a + €X) = j tr [eaA{t)XA{t) - €^aA{t)XA{t)XA{t) + 0{e^)] dt 
0 


( 41 ) 


and 


D [a + eX||cr) 


OO 

J tr [-eaA{t)XA{t) + €^aA{t)XA{t)XA{t) - €^XA{t)XA{t) + ©(e^)] dt. 
0 

(42) 


As [A{t),a\ = 0 we can diagonalize these operators in the same orthonormal basis 
{|i)} C which leads to 


OO 



tr [aA{t)XA{t)] dt 


0 




Si 


i=l 


{Si + t) 


-dt = = 0 


2=1 


(43) 


where denotes the spectrum of cr. Note that again by diagonalizing cr and A{t) 

in the same basis we have 


J tr [{2aA{t) — 1) {XA{t)XA{t))\ dt 
0 


= E 

i,i=l 

( s . - s ,)2 

t,J — l 


OO 


/ 


_2si_ 

{Si + t)'^{Sj + t) 


1 

{Si + t)(sj + t) 



Sj) - (s* + Sj)log 



dt 


= 0 . 


(44) 


(45) 


The last equality follows from the fact that the expression in (45) clearly changes its 
sign when Si and Sj are exchanged. This is only possible if the value of the integral 
(44) vanishes. Rearranging the integral (44) gives: 


OO 

J tr [aA{t)XA{t)XA{t)] dt 
0 


OO 



tr [-aA{t)XA{t)XA{t) + XA{t)XA{t)] dt. 


0 


(46) 
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Finally applying (43),(46) to the formulas for the relative entropies (41) and (42) gives: 


D {a\\<7 + eX) _ c + 0{e) 
D{a-\-eX\\(j) c + 0{e) 


as e —)■ 0. Here c := / tr [(TA{t)XA{t)XA{t)] dt > 0 as a,A{t) > 0 for any t S [0,oo) 
0 

and X 7 ^ 0 is Hermitian. 

□ 
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